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Abstract 

We determine a necessary and sufficient condition for a polynomial over an algebraically closed field 
k to induce a surjective map on matrix algebras Mn{k) for n > 2. The criterion is given in terms of 
algebraic conditions on the polynomial and the proof uses simple linear algebra. Following that, we 
formulate and prove a corresponding result for entire functions as well. 


Introduction ; 

Suppose that A is a complex square matrix. The question of existence of a matrix B such that = A is 
a very well-known problem in linear algebra. It turns out that it is not possible to find such a B for every 
A. In this note we extend the above question. Let / be any polynomial over the complex numbers. Is it 
possible to find a matrix B for every A such that f{B) = A? We extend the question even further and ask 
the same for an entire function / instead of a polynomial. After we prove our results, we can deduce some 
known results as corollaries e.g., the surjectivity of the exponential map on the invertible matrices and 
a question (Picard’s theorem for matrices) originally asked by Polya and answered by Szegd, which appears 
as an exercise in [2]. 

We begin with an arbitrary algebraically closed field k. We denote the set of polynomials in one variable 
over k by k\X] and the set of n x n matrices with entries from k by M„(fc). A polynomial / € k\X] induces 
a map M„(/) : M„(fc) —)• M„(fc), where A y) aM* G M„(fc). By abuse of notaton, we denote M„(/)(A) as 
/(A), when n is understood. Let /' G k[X] denote the derivative polynomial of / and let Z{f') denote the 
set of zeros of the polynomial /'. If t G fc is such that C Z{f'), we say that t is a critical value of /. 

Theorem 1. Let n > 2 be fixed. M„(/) : M„(fc) —>■ M„(fc) is non-surjective iff there exists a t G fc such that 
C Z{f'), i.e., / has a critical value. 

Note: The algebraic condition on the polynomial / is independent of n. So either M„(/) is a surjection for 
all n > 2 or a non-surjection for all n > 2. 

Examples: 

1. Evidently, every polynomial / G k[X] of degree 1 induces a surjection on M„(fc) for any n >2. 

2. Let / G k[X] be a quadratic polynomial, i.e., / = aV^ -|- bX + c. If char k ^ 2, then Z{f') will be the 
singleton {—b/2a} and the fibre f~^{f{—b/2a)) = {—h/2a}. If char k = 2, and b — 0 then Z{f') = k. 
So applying our result in both of these cases, we see that M„(/) cannot be a surjection. If charfc = 2 
and & yf 0, then Z{f') is empty, hence M„(/) is a surjection. 

3. Let char k = p. We show that for any d > 2, there exists a polynomial / of degree d for which M„(/) 

is a surjection, li p \ d, then consider f(z) = + z. Since Z{f') is empty, it induces a surjection. If 


I 


p does not divide d but p \ d — 1, then let f{z) = z'^ + z‘^~^. Since Z{f') = 0 and {—1} S /“^(O), it 
follows that / induces surjection. The only remaining case is gcd(p, d{d — 1)) = 1. Let f{z) = z‘^ — dz. 
Since p does not divide d— 1, f'{z) has d — 1 distinct roots {Ci, ■ ■ ■, Cd-i}- Let there be a t G fc such 
that C Z{f'). Then for some 1 < r < d — 1, is a root of the polynomial h{z) = f{z) — t. If 

the multiplicity of in h{z) is at least 3, the multiplicity of C,r in h'{z) = f’{z) is at least 2. But this 
is a contradiction since f'{z) has d — 1 distinct roots. So the multiplicity of Cr in h{z) is at most 2. 
Since deg(d(z)) = d > 2, there exists another root of h{z). So there exists some l<r 7 ^s<d—1 such 
that is a root of h{z). But then t = /(Cr) = f{Cs), which imples (1 — d)(r = (1 — d,)(s, contradicting 
the fact that Cr ^ Cs- Hence there cannot be any t € k such that C Z{f'). Therefore / induces 

a surjection. 

We recall some definitions before going into the proof of Theorem 1. Recall that a Jordan matrix J is a 

block diagonal matrix 
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The following facts are quite well-known and their proofs of can be found in [3] for example. 

1. If A: is algebraically closed, any matrix A G Mn,(A:) is similar to a Jordan matrix J. The matrix J is 
said to be the Jordan normal form of A. 

2. The Jordan normal form of a matrix is unique up to a permutation of the diagonal blocks. 

3. Ai’s are eigenvalues of A and number of Jordan blocks corresponding to the eigenvalue Xi in the Jordan 
normal form of A is the dimension of the Kernel of {A — \i ■ I). This is a direct consequence of the 
rank-nulity theorem. 

Now we prove two lemmas which will be used to prove the theorem. 

Lemma 1. Let U he a Jordan block with A as its eigenvalue. Let p{X) = X]m=o ^ k[X]. Then, 

p{U)ij = 0 for i > j and 

otherwise. (Here we follow the convention that = 0 for g > p.) 

Proof. Observe that U = XI + N where Nij = 1 if j — z = I, and Wj = 0 otherwise. By linearity, it suffices 
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to prove the lemma in the case of p{X) = , which is done below. 

U^^ = {\I + N)% 



Now, ’’ is nonzero, only when j 
Hence the sum equals. 


as asserted. 


i = k - r, or r = k - {j - i). 
U - i)J \j - V 


Note: Irrespective of the characteristic of k, ii j — i 
p{U)ij = for all 1 < z < j < n 


1 , we have p{U)ij = p'{X). If char fc = 0, then 


Lemma 2. Let n > 2. It U € M„(fc) is a Jordan block with A as its eigenvalue and p{X) € k[X], then the 
Jordan normal form of p{U) has at least two Jordan blocks if and only if p'{\) = 0. 

Proof. To prove this lemma, we use the third fact noted earlier. By Lemma 1, p{X) is the only eigenvalue of 
p{U) and number of blocks is equal to the dimension of Ker(P([/) — p{X) ■ I). So if p'{X) = 0, then the first 
two columns of P{U) — p{X) ■ I are zero by Lemma 1. Hence the rank is at most n — 2. By the rank-nullity 
theorem, Ker(P([/) — p{X) ■ I) >2. 

Conversely, if p'{X) ^ 0, the matrix P(U) — p{X) ■ I has n — 1 linearly independent columns. Indeed, by 
Lemma I, the first column ci of P{U) — p{X) ■ I is zero and the fc-th column (for 2 < fc < n) is 

..»)■ 

Since p'(A) 7 ^ 0, all the n—1 vectors Cfc, for 2 <k< n are linearly independent. So dimKer (P(P) — p{X) ■ I) = 
1, and p{U) has only one Jordan block. 

Proof of Theorem 1. Let t be a critical value of /. We take Y S Mn(k) such that F is a Jordan block 
with t as eigenvalue and show that Y is not in the image of Mn{f). Assume that f{X') = Y. Let X be the 
Jordan normal form of X'. So f{X) is similar to Y (using f{PAP~^) = Pf{A)P~^) . Since F is a Jordan 
block, X also has to be a Jordan block. Otherwise Jordan normal form of ,f{X) will not be a single Jordan 
block and hence cannot be similar to F. Let u be the eigenvalue of X. Hence f{u) is the only eigenvalue 
of f{X) (by Lemma 1). Since f{X) is similar to F, there eigenvalues have to be the same. So f{u) = t 
u € C Z{f'). Hence f'{u) = 0. But then by Lemma 2, Jordan normal form of f{X) has more 

than one Jordan block. Therefore, it cannot be similar to a single Jordan block F. 

Before proving the converse, we prove two more lemmas which will be useful later. 

Lemma 3. Let r > 1 and F G Mr(fc) be a Jordan block with eigenvalue A such that A is not a critical value 
of /. Then there exists a Jordan block X in Mj.{k) such that Jordan form of f{X) = Y. 

Proof. For r = 1, the result is clear, since k is algebraically closed. So we assume that r > 2 in what follows. 
Since f~^{X) Z{f'), there exists u € k such that f{u) = X and f'{u) ^ 0. Let X € Mr{k) be the Jordan 

block with eigenvalue u. By Lemma 2, Jordan form of f{X) is a Jordan block of order r and has eigenvalue 
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A, so it has to be equal to V. 


Lemma 4. Let B G Mn(k) such that the eigenvalues of B are not critical values of /. Then B lies in the 
image of 

Proof. Let V be the Jordan normal form of B. Then Y = diag(Yi,... ,Yp) is a Jordan matrix, where l^’s 
are Jordan blocks. Using the hypothesis on eigenvalues of B and Lemma 3, there exists a Jordan block Xi 
such that Jordan form of f{Xi) = Yi. So Jordan form of diag(/(Ari),..., f{Xp)) = Y. Now consider the 
matrix X = diag(Jfi,..., Xp). Since 


/(^) = /(diag(Xi,..., Xp)) = diag(/(Xi),..., f{Xp)), 


the Jordan form of f{X) is equal to Y. In particular f{X) is similar to Y and consequently to B^ which 
implies that B is in the image of M„(/) (using f{PAP~^) = Pf{A)P~^ ). 

Therefore, if / has no critical values. Lemma 4 implies that Mn{f) is a surjection. This proves the 
converse and finishes the proof of Theorem 1. 

We have established the following in the first part of the proof of Theorem 1, which we note down 
explicitely as a lemma for later use: 

Lemma 5. If t is a critical value of / and Y G Mn(k) is a Joran block with eigenvalue t, then Y does not 
belong to the image of M„(/). 

Matrices of entire functions: 

A function / : C —>■ C is said to be entire if it holomorphic on the whole of C. Given an entire function, we 
write it as a power series around zero : f{z) = For an entire function /, f{A) := 

is a well-defined matrix. The reader may refer to [I] for its proof. So sending A to f{A) gives a map 
M„(/) : M„(C) —>■ M„(C). Let /' be the derivative of / and let Z{f') denote the zeros of the /'. If t G C is 
such that f~^{t) C Z{f'), we say that t is a critical value of /. Note that if t 0 /(C), t is a critical value of 
/• 

We define Dn{f) to be the set of all matrices in Mn{C) whose eigenvalues lie in /(C). Lemma I easily 
extends to the case of entire functions. In particular, if Ai,...,A„ are eigenvalues of A G M„(C), then 
/(Ai),... ,/(An) are the eigenvalues of f{A). So Im(M„(/)) C D{f). Since f{PAP~^) = Pf{A)P~^ and 
Dn{f) is closed under conjugation, the discussion for the case of polynomials applies mutatis mutandis 
to entire functions and yields the corresponding versions of all our lemmas. We also obtain the following 
theorem: 

Theorem 2. Let n > 2 be a fixed. Then Im(M„(/)) ^ Dn{f) iff there exists a t G /(C) such that 

r\t) c z{n- 

Note: By Picard’s little theorem [^, if / is a non-zero entire function then at most one complex number 
does not belong to the image of /. So Dn{f) is either M„(C) or the set of all matrices whose eigenvalues 
does not equal to pf for some fixed complex number pf. 

Examples. 

1. Let f(z) = exp(z). Then /(C) = C*. Hence Dnif) = GL„(C), the set of n x n invertible matrices over 
C. Since Z{f') = Z{f) = 0, the matrix exponential is a surjective map from M„(C) —>■ GL„(C). 

2. sinz and cosz are surjective entire functions. But the maps they induce from M„(C) —^ M„(C) are 
not surjective for n > 2. Indeed, we have sin^ z -|- cos^ z = 1. So sin“^({±I}) C Z(cosz) and 
cos“^({±I}) C Z(sinz). Hence by Lemma 5 (which remains true for entire functions, as noted in the 
previous discussion), Jordan blocks with eigenvalue I and —I are not in the image of M„(sin(z)) or 
M„(cos(z)). 
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3. Picard’s theorem for matrices [2] : Let Cf denote the set of all critical values of an entire 
function /. As a corrollary to the Second Fundamental Theorem of Nevanlinna, one obtains that 
I (7/1 < 2. The reader may refer to [1] for an exposition on Nevanlinna theory, which also contains the 
stated corrollary. Now by using Lemma 4 and Lemma 5 (both of them extend to the case of entire 
functions, as already noted) we obtain that an entire function / has at most 2 ’’exceptional values” 
in the following special sense: A G M„(C) lies in the image of M„(/) if none of the eignevlaues of A 
coincides with an exceptional value of /. On the other hand, there are certain matrices with eigenvalues 
consisiting of exceptional values not belonging to the image of M„(/). 
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